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ABSTRACT 

Important but rare and subtle processes driving galaxy morphology and star-formation 
may be missed by traditional spiral, elliptical, irregular or Sersic bulge/disk classifi¬ 
cations. To overcome this limitation, we use a principal component analysis of non- 
parametric morphological indicators (concentration, asymmetry, Gini coefficient, M 2 0 , 
multi-mode, intensity and deviation) measured at rest-frame B-band (corresponding 
to HST/WFC3 F125W at 1.4 < z < 2) to trace the natural distribution of massive 
(> 10 10 Mq) galaxy morphologies. Principal component analysis (PCA) quantifies the 
correlations between these morphological indicators and determines the relative im¬ 
portance of each. The first three principal components (PCs) capture ^75 per cent 
of the variance inherent to our sample. We interpret the first principal component 
(PC) as bulge strength, the second PC as dominated by concentration and the third 
PC as dominated by asymmetry. Both PCI and PC2 correlate with the visual ap¬ 
pearance of a central bulge and predict galaxy quiescence. PCI is a better predictor 
of quenching than stellar mass, as as good as other structural indicators (Sersic-n or 
compactness). We divide the PCA results into groups using an agglomerative hierar¬ 
chical clustering method. Unlike Sersic, this classification scheme separates compact 
galaxies from larger, smooth proto-elliptical systems, and star-forming disk-dominated 
clumpy galaxies from star-forming bulge-dominated asymmetric galaxies. Distinguish¬ 
ing between these galaxy structural types in a quantitative manner is an important 
step towards understanding the connections between morphology, galaxy assembly 
and star-format ion. 

Key words: galaxies: elliptical and lenticular - methods: statistical - catalogues 
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1 INTRODUCTION 

Massive galaxies today form stars at a lower rate than in 
the past due to many factors. However, we do not have a 
complete accounting of the processes quenching the star- 
formation in galaxies. An increase in the mass/number den¬ 
sities ( Tomczak et al.|2014 van der Wei et al.|2014 1 of mas¬ 
sive, red galaxies implies stars are not forming to the same 
extent they once were. Each of these observations attempt to 
connect of observed color (or star-formation rate) and stel¬ 
lar masses to morphology. The star-formation rate - stellar 
mass (SFR—M„) relationship shows star-forming galaxies 
at z ~ 0 follow a “main sequence” (Brinchmann et al.|2004 


Wuyts et al. 2011). Galaxies on the main sequence are bluer 
and have lower Sersic-indices than galaxies below the rela¬ 
tion. Massive galaxies with low SFRs are red and have high 
Sersic indices and bulge strengths. The SFR— M* morphol¬ 
ogy relation has been shown to hold out to z ~ 2.5 (Wuyts 


et al. 2011). However, bulge strength has been described 


as a “necessary but not sufficient” condition for quenching 
star-formation in z < 2.2 galaxies (Bell et al.|2012L 

If the presence of a bulge is not sufficient to fully quench 
a galaxy, other factors such as size may be important for 
shutting down star-formation. At redshifts a ~ 1.5, galax¬ 
ies of sufficiently high mass and small size are quenched 
(Barro et al7j[2013). This suggests a relationship between 
so-called “compactness” (Si.5 = M/r^' 5 ) and the specific 
star-formation rate (sSFR) . However, the number density 
of these compact galaxies has been decreasing with the age 
of the universe. 

The mechanisms for quenching star-formation and 
transforming the morphology of galaxies are not fully un¬ 
derstood. Proposed mechanisms include: major mergers (e.g. 
Naab et al. |2006 Hopkins et al.||2010 1 ; minor mergers (e.g. 


Hopkins fe Hernquist||2009| |Villforth et al.||2013[ |Taniguchi 


1999); secular processes (for review see jKormendy fe Ken- 
nicutt|2004 Cisternas et al.|2011 l; AGN feedback (e.g. Silk 
& Rees 1998 Schawinski et al.||2006[); and mass quenching 
] Dekel fe Birnboim||2006 Bell et al.||2012 1. Comprehensive 


models of galaxy formation can yield a reasonable link be¬ 


tween galaxy morphology and star formation (e.g. Snyder 
et al.| 2015) but we do not yet have a perfect accounting of 
how all these processes might contribute. 

As a result, two evolutionary tracks have been devel¬ 
oped to explain the disappearance of compact, quenched 
galaxies: (1) major mergers at z ~ 2-3 quickly cause a 
galaxy to quench, which later grow through minor merg¬ 
ers and gas accretion; (2) violent disk instabilities/secular 
processes/minor mergers at z~1.5 cause a slower decline in 
star-formation and simultaneous size growth before the qui¬ 
escent phase. 

To study the processes driving evolution, we need a 
method to effectively and efficiently characterize the struc¬ 
tures and shapes of galaxies. Visual classifications have been 
used since the discovery of galaxies, and have subsequently 


been adapted to fit modern surveys (e.g. Galaxy Zoo, Lin- 
tott et al. 2008 Kartaltepe et al. 20141. Visual classifications 


can find subtle structural elements possibly missed by an au¬ 
tomated routine. However, human classifications of galaxies 


can be very time consuming and subjective. Additionally, 
galaxy structure at high redshift does not always correspond 


to the local Hubble sequence) Bruce et al. 2012 Bell et al. 


2012 Kriek et al. 2009 Lee et al. 20131. Disk-dominated 


galaxies can appear clumpy (Forster Schreiber et al.||2009l 


and bulge-dominated galaxies can be compact, very red and 


massive, but possess no extended envelope (e.g. van Dokkum 
et al.|2008 1. 


GALFIT ( |Peng et al.| 20021 [Mo| ) is an automated tech¬ 
nique often used to classify galaxies that models the light 
profile of galaxies with a Sersic profile Disks have 

exponential light profiles (n=l), while ellipticals are best fit 
by a de Vaucouleurs profile (n=4). GALFIT is sensitive to 
small galaxies, can distinguish overlapping light profiles of 
nearby galaxies, incorporates the point spread function of a 
specific held/detector, and most importantly is easy to inter¬ 
pret. However, GALFIT assumes a symmetric and smooth 
light profile, which at times can be problematic. This as¬ 
sumption does not hold for irregular galaxies, merger rem¬ 
nants, and disk galaxies with bars or clumps. GALFIT is 
typically used to calculate a single Sersic fit to the light pro¬ 
file of the galaxy. Two-component Sersic fits has also been 


used to combine disk and bulge components (e.g. Simard 
|et al.|2011| |Bruce et al.|2014a|b| ); however, calculations can 
be quite CPU intensive, sometimes needing weeks to finish. 

Quantitative non-parametric morphological statistics 
characterize galaxy structure and do not assume an analytic 
light profile. This fact allows us to apply automated char¬ 
acterization to irregular galaxies as well. Examples of non- 
parametric morphological indicators include: concentration 


index ( C , Bershady et al. 2000 Conselice 20031, asymme¬ 
try (A, Conselice et al.|2000l, Gini coefficient (G, Abraham 


et al. 2003| Lotz et ah] 2004 1, M 20 (|Lotz et al~|2004 i, and 
three new statistics from Freeman et al.|(2013|): Multimode 


(M), Intensity (/), and Deviation ( D ). The MID statistics 
have been found to be sensitive to mergers and clumpy star- 
formation, even at high redshift ( Freeman et al.|2013 |. 

However, for many galaxies these statistics can be 
strongly correlated. Moreover, cosmological models of 
galaxy formation yield a picture in which these structures 
can evolve quickly along diverse paths, thereby motivating a 
broad deep classification system ( Snyder et al.|20l4 l. There¬ 
fore we require further analysis to understand the inher¬ 
ent relationships among these statistics, and between these 
statistics and galaxy assembly processes. 

Principal component analysis (PCA) is a simple way to 
reduce the dimensionality, break internal degeneracies and 
find the natural distributions of data in parameter space. 
To eliminate degeneracies inherent in these morphological 
statistics we performed a PCA using 7 non-parametric mor¬ 
phology measurements on 1244 galaxies from 1.36 < z < 
1.97. PCA has been shown to efficiently classify galaxies 
(e.g. Taghizadeh-Popp et al. 2012 the Zurich Estimator of 
Structural Types (ZEST), Scarlata et al.|2007 |. A few stud¬ 
ies immediately capitalized on the ZEST classifications to 


study the number density evolution of disk galaxies ( Sargent 
et al.|[20071, the luminosity function evolution for elliptical 
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galaxy progenitors ( Scarlata et al.|2007 l, and the evolution 
of the galaxy merger rate to 2 ~ 1 ( Kampczyk et al.|2007 |. 

I11 this paper, we use PCA and hierarchical clustering 
to classify galaxies based on their structure. These classi¬ 
fications allow us to characterize galaxies by more subtle 
means than the traditional Hubble sequence scheme. We 
can test the mechanisms which cause galaxies to reassem¬ 
ble and/or influence star-formation by tracking how mor¬ 
phologies change across time. This places vital constraints 
on the physical mechanisms assembling galaxies and quench¬ 
ing star-formation. 

This paper is structured as follows: O details the CAN- 
DELS data set and our sample selection; q3| defines the non- 
parametric morphologic measurements we perform, their as¬ 
sociated error estimates and the principal component analy¬ 
sis as applied to our data set; ^describes the results of our 
PCA, the clustering algorithm and convex hull method used 
for grouping galaxies, a test of the group, and PCA reliabil¬ 
ity and descriptions of the final galaxy groups; E describes 
the general morphological characteristics of the galaxies 
in each group; E details the relationship between stellar 
mass, quenching, and our group classifications. Additionally, 
the disagreement between Sersic and visual classifications 
with PC-based classification (especially for compact/bulge- 
dominated galaxies) is discussed. 

All magnitudes are quoted in the AB system. A stan¬ 
dard ACDM cosmology of Hq = 70 km s -1 Mpc -1 , Hm = 
0.3, and Ha = 0.3 is used throughout this work. 


2 DATA 

The Cosmic Assembly Near-IR Deep Extragalactic Legacy 
Survey (CANDELS, Pis: S. Faber and H. Ferguson; |Grogin| 
et al. 2011 and Koekemoer et al. 20111 observed 5 heav¬ 


120131 are the combination of multiple different photomet¬ 


ily studied fields (of which we use UDS, GOODS-S and 
COSMOS) with the Hubble Space Telescope (HST). High 
resolution imaging by Wide Field Camera 3 (WFC3) in 
near-infrared bands F125W (J) and F160W (H), combined 
with observations from the Advanced Camera for Surveys 
(ACS) in visible bands F606W (V) and F814W (/„) con¬ 
stitute the new measurements in the CANDELS program. 
For the purposes of our study, we initially focus only on the 
F125W WFC3 images. Future work will study the evolu¬ 
tion of galaxy morphology at a consistent rest-frame wave¬ 
lengths. 

We use the CANDELS H-band (F160W) selected multi¬ 
wavelength catalogs (UDS, Galametz et al. 2013) GOODS-S, 
Guo et al. 1 |2013| COSMOS, Nayyeri et al., in prep), photo¬ 
metric redshifts (|Dahlen et al.||2013[), non-parametric mor¬ 


phologies (this work), Sersic parameters (van der Wei et al. 


20121, visual classifications (Kartaltepe et al. 20141, rest- 


frame photometry, and stellar masses (this work). The limit¬ 
ing magnitude for HST/WFC3 F125W and F160W are 27.35 
and 27.45 respectively with FWHM of ~0.135” and ~0.15” 


respectively. Galametz et al. (20131 outlined the techniques 


ric redshift calculating codes and techniques which reduce 
the scatter of photometric redshifts (to cr ~ 0.03, with an 
outlier fraction of 3 percent). Throughout the rest of this 
paper, we use 2 to denote the average photometric redshift 
in these CANDELS catalogs (Mobashe r et al.|2015 |. 

Rest-frame U — V — J colors were calculated by the sed- 
fitting code EAZY ( Brammer et al.|2008 l, using the empir¬ 
ical local galaxy templates of Brown et al. (20141. Stellar 
masses were computed with FAST ( |Kriek et a!.||2009| , as¬ 
suming Bruzual & Chariot (2003 ) delayed exponential star- 


formation histories, a Chabrier (20031 initial mass function, 
|Calzetti et al.| ( |2000| ) dust attenuation, and solar metallici- 
ties. 


2.1 Sample Selection Criteria 

We select bright (H < 24.5), massive (M* > 10 10 Mq) 
galaxies with 1.36 < 2 < 1.97 galaxies measured in F125W 
(J). This band approximately corresponds to rest-frame op¬ 
tical B-band at these redshifts. This redshift range provides 
a large sample of galaxies measured in a constant rest-frame 
waveband, and offers a high enough redshift to have a differ¬ 
ent morphological distribution from a local sample. At this 
redshift and magnitude, the CANDELS surve ys are mass- 
complete down to 10 10 Mq (Wuyts et al. [2011). In our sam¬ 
ple of UDS, COSMOS and GOODS-S there are a total of 
6269 galaxies with H < 24.5 and M* > 10 10 Mq. Of those 
galaxies 1539 are within our redshift range (1.36 < z < 1.97). 

The following affect our sample completeness: high 
signal-to-noise (per pixel) measurements (S/N > 4), an in¬ 
ternal morphology quality flag = 0, and a well measured 
concentration (i.e. C 7I -99) requirement. The quality flag re¬ 
quirement removes objects from the sample with discontigu¬ 
ous segmentation maps resulting from low surface bright¬ 
ness, and/or poor masking of bright neighbors. In Appendix 
[X]we include a brief discussion of galaxies with a quality flag 
= 1. The concentration requirement removes the contami¬ 
nation from poorly measured galaxies on the overall PCA. 
For some galaxies, r^o (and thus C) can not be accurately 
measured because either the object is too small, or there is 


used to create the photometric catalogs. 

The photometric redshift catalogs of |Dahlen et al.| 


a bright point source disrupting the light profile (see f 3.1.21. 
The concentration requirement reduces the total of galaxies 
in the sample to 1482. The FLAG requirement reduces the 
sample to 1250. The signal-to-noise, FLAG and well mea¬ 
sured concentration requirements together reduce our final 
sample to 1244 galaxies. 


3 MORPHOLOGICAL MEASUREMENTS 
3.1 Non-parametric Morphology 

We focus on non-parametric morphology statistics: concen¬ 
tration, asymmetry, Gini coefficient, M 20 , along with three 
new statistics from [Freeman et al.|2013| multi-mode, inten¬ 
sity and deviation. The code for calculating the morpho¬ 


logical statistics (originally developed by Lotz et al. 2008) 
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has been modified to include the new statistics and accom¬ 
modate much larger input images. The code is applied to 
the CANDELS F125W mosaics using the F160W detected 
catalogs and segmentation maps as the input. 


3.1.1 Petrosian Radius 

The Petrosian radius r p is the radius we set to where the 
surface brightness y. is 20 per cent of the the mean interior 
surface brightness (Eq. [l] Petrosian| [l 976[ ). The Petrosian 
radius is more robust to surface brightness dimming than 
isophotal sizes are. We can measure the same physical por¬ 
tions for galaxies at a variety of redshifts (e.g. |Lotz et al. 
20041. 


Due to their uniform morphologies and lack of structure 
elliptical galaxies typically have small asymmetry values (A 
~ 0.02). Meanwhile spiral galaxies usually have values be¬ 
tween A ~ 0.07 to 0.2 (Conselice 20141. This statistic is most 
useful for identifying irregular galaxies because they appear 
lopsided or ragged. Visually inspected merger remnants can 
have A > 0.3 ( Conselice| 2003). The asymmetry statistic is 
more sensitive to gas-rich mergers than to gas-poor or minor 
mergers ( Lotz et al.|2010a|b |. 

If the local background is high and the galaxy is is suf¬ 
ficiently low surface brightness then negative A values are 
measured. This is consistent with measurement errors (see 

p~2 1. 


fj.{r < r v ) 


(1) 


3.1.2 Concentration 


The concentration index (G; Bershady et al.|2000 Conselice 
2003) is the ratio of the circular radius containing 80 per cent 


(rgo) of a galaxy’s light (as measured within 1.5 Petrosian 
radii) to the radius containing 20 per cent (r 2 o) of the light 
(Eq# A large concentration value indicates a majority of 
light is concentrated at the center of the galaxy. Elliptical 
galaxies and bulge-dominated spirals have high concentra¬ 
tion values. However, a spiral or irregular galaxy with diffuse 
light profile and weak/no bulge will have low concentration 
values. 


C = 5 log f — 
\ r 20 


( 2 ) 


For some galaxies ?' 2 o (and thus C) can not be accu¬ 
rately measured because either the object is too small, or 
there is a bright point source disrupting the light profile. 
These galaxies instead have unphysical concentration val¬ 
ues (G < 0) and are not included in the definition of our 
principal components (see 12.1). 


3.1.4 Gini Coefficient 


The Gini coefficient (G; |Lorenz | |1905| jAbraham et al.||2003| 
Lotz et al.|2004 | is a statistic adapted from economics that 
measures the equality of light distribution in a galaxy. The 
Gini coefficient is defined by the Lorenz curve of the galaxy’s 
light distribution, and is not affected by spatial position. 
This implies that only the amount of light distribution mat¬ 
ters, which differentiates the Gini coefficient from the con¬ 
centration statistic ( |Conselice| |20141. 

The pixels are ranked by increasing flux value, then G 
is determined by Eq. [4] where n is the number of pixels in 
the galaxy’s segmentation map, Xi is the pixel flux at the 
rank i pixel and A' is the mean pixel value. 


G = 


1 

Xn(n 



n — l)Xi 


(4) 


A galaxy with equally distributed light will have a Gini 
coefficient approaching 0. Conversely, a galaxy with a large 
fraction of light concentrated on a few pixels will have a 
Gini coefficient closer to 1. Elliptical galaxies and galaxies 
with bright nuclei have high Gini coefficients, while disks 
and galaxies with a uniform surface brightness will have low 
Gini coefficients. 


3.1.3 Asymmetry 


Asymmetry (A; |Conselice et al. 2000) measures the differ¬ 
ence between the image of a galaxy (Ix, y ) and the galaxy 
rotated by 180 degrees (Iiso(x, y )', Eq. [3j). This determines a 
ratio of the amount of light distributed symmetrically to all 
light from the galaxy. A is calculated from a sum of all pix¬ 
els within 1.5 Petrosian radii from the center of the galaxy. 
We then correct by Biso, which is the average asymmetry of 
the background. An initial guess for the center of rotation 
is defined by the physical center, but is updated through 
an iterative process. This process continues until a global 
minimum value for A is found (Conselice et al.|2000). 


A = 


E, 


\t(x,y) Il80(x,y) | 

2 E \hx,y) | 


-B 1 


(3) 


3.1.5 M 20 

The second order moment of the brightest regions of a galaxy 
(M 2 o; |Lotz et a l.]20041 traces the spatial distribution of any 
bright clumps. When used in tandem with the Gini coeffi¬ 
cient, M 20 can be an effective tool for differentiating galax¬ 
ies with bright off-center clumps (such as irregular galaxies) 
from those with one bright central region (such as the bulge 
of a spiral galaxy). We define the regions representing the 
brightest 20 per cent of the galaxy (Eq. [5|, and then cal¬ 
culate the spatial distribution of those pixels as an offset 
from the central pixel. The center is defined as the position 
minimizing Mtot- 


# fi < 0.2/t o t 


(5) 
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Galaxy 17102, Threshold = 0.92 




Figure 1. F125W (AB) = 22.2 CANDELS galaxy image is shown to demonstrate the M, / and D statistics. The left panel shows 
the image of the galaxy outlined by the segmentation map created using our morphology code. The middle panel shows red outlines 
describing the clumps found when calculating the M statistic. The white X displays the location of the brightness distribution peak, 
and the cyan circle represents the location of the intensity centroid used to calculate the D statistic (j |3.1.8ty . The right panel color codes 
the clumps for easy identification. This galaxy is highly disturbed and is broken into 3 bright regions, with the brightness peak well 
separated from the intensity centroid. The threshold value (qi) in this case is 0.92, which represents the threshold where the M statistic 
was maximized. 


Mtot = Yl Mi = f* [( Xi - X A 2 + (yi - vA 2 ] (6) 

i i 

Finally we calculate the second order moment (Eq. [Tj). 


Moo = log 


E jMj 

Mtot 


(7) 


Values for the M 20 statistic are generally between -0.5 
and -2.5. Elliptical galaxies have M 20 closer to -2.5 signifying 
a lack of bright-off center clumps. Meanwhile disk galaxies 
can have M 20 > -1.6 when, for example, bright star-forming 
knots are present. Similar to concentration, M 20 is biased 
low for galaxies where the brightest 20 per cent light is un¬ 
resolved. 


3.1.6 Multi-mode 

The multi-mode (M) statistic is the ratio, in pixels, of the 
two brightest regions of a galaxy (adapted from Freeman 
et al.|[2013 |. Bright regions are determined via a threshold 
method where qi represents the normalized flux value, and 
l per cent of pixel fluxes are less than qi. This creates a new 
binary image gtj where 1 represents fluxes larger than qi 
and 0 represents fluxes less than qi (Eq. [8| . 


value 1 are then sorted in descending order by area. The 2 
largest groups (^4;,( 2 ) and Aqi)) define an area ratio Ri : 


Ri = 


A U 1 ) 


(9) 


The previous two steps are recomputed for various nor¬ 
malized flux levels l, and the M statistic is the maximum 


Ri value (Eq. 101. Values approaching 1 represent multiple 


nuclei, while values near 0 are single nuclei systems. 


M = max Ri 


( 10 ) 


This formulation is slightly revised from|Freeman et al.| 


120131 to limit the M statistic to values between 0 and 1. 
|Freemanet al.| (2013) multiplies Eq.[9]by an additional factor 
°f -Aq 2 ) to limit the effect of hot pixels. However, this adds 
a size dependent factor to the calculation. Because we wish 
to measure M values for galaxies at a variety of angular 
distance scales, it is important to have a size independent 
measure. For illustrative purposes, Fig. [I] shows an example 
of how the MID statistics are calculated. In small galaxies 
that are poorly resolved Ait n is very small (approaching 
zero) and we set M=-99. We have tested the result of setting 
M=-99 values to M=0 but find the PC weights and group 
assignments are very similar to the original values. 


9i,j — 


}i,j ^ qi 
otherwise 


( 8 ) 


The number of pixels in contiguous groups of pixels with 


3.1.7 Intensity 

Intensity (I) is the ratio, in flux, of the two brightest regions 


(Freeman et al. 20131. The galaxy image is first smoothed 
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by a symmetric bivariate Gaussian kernel. Regions are de¬ 
fined using maximum gradient paths, where the surrounding 
eight pixels of every pixel are inspected and the path of max¬ 
imal intensity increase is followed until a local maximum is 
reached. Regions consist of pixels linked to a unifying local 
maximum. The fluxes within these groups are summed and 
sorted into descending order (by total flux) leading to our 
intensity ratio: 


i-p (in 

fii) 

Similar to the M statistic, elliptical galaxies with a 
bright bulge have I ~ 0, while disk galaxies with bright 
clusters of star-formation are more likely to have I values 
approaching 1. 


3.1.8 Deviation 

Deviation ( D ) measures the distance between the intensity 
centroid of a galaxy and the center of the brightest region 

and Eq[# Disk and spheroidal 
galaxies have deviation values near 0 because their central 
bulges typical possess the brightest pixels. On the other 
hand, a high deviation value indicates a galaxy has bright 
star forming knots significantly separated from the intensity 
centroid (e.g. Fig. [T|. 


I Freeman et al.j2013 Eq. 


12 


((Teen, ycen ) 




( 12 ) 


The deviation statistic D is the Euclidean distance (in 
pixels) between the intensity centroid and brightest pixel 
scaled by a crude estimate of a galaxy’s radius based upon 
the number of pixels comprising the galaxy. 



x hi) ) 2 (y° eri 


Vh o ) 2 


(13) 


3.2 Morphological Principal Components 


Principal component analysis (PCA) is a linear transforma¬ 
tion of multivariate data. This defines a set of uncorrelated 
axes, called principal components (PCs), which are ranked 


by the variance they capture (Pearson 1901 Ivezic et al. 


2013). A linear combination of the original data and eigen¬ 


vector solutions (also called weights) project the original 
data onto the PCs. Principal component analysis is a sim¬ 
ple way to reduce the dimensionality and find the natural 
distributions of data in parameter space. PCA is able to 
determine the correlations between the input data and can 
find relationships missed by other means. 

We begin by “whitening” the data, i.e. we subtract the 
mean of each morphological measurement and divide by the 
standard deviation of each feature. By dividing our data by 
feature variance we remove the effects of mixed units. We 
calculate the singular value decomposition (x,-j = V"EV T , 


SVD) of the “whitened” data matrix (xi_/). An SVD decom¬ 
poses the original data into a diagonal matrix (E) contain¬ 
ing eigenvalues (e) and a non-diagonal matrix V containing 
the expansion coefficients (aka weights). The eigenvalues de¬ 
termine how important each principal component is to ex¬ 
plaining the original data set. The eigenvectors are rank or¬ 
dered by their associated eigenvalue. We then project our 
“whitened” data onto our new eigenbasis to calculate the 
principal component scores, which inform us how similar 


are data points to each other ( PCi , Eq. 14). 




(14) 


3 = 1 


Table[l]shows the correlations and importance of differ¬ 
ent statistics across the eigenvector solutions of the principal 
component analysis. The scree value (e 2 /)T)e 2 ) represents 
the amount of variance captured by a single principal com¬ 
ponent. The scree values demonstrate that the first 3 PCs 
account for >75 per cent of the variance in the data. The 
fact that PCI only accounts for 40 per cent of the variance 
shows that more than a single parameter is needed to define 
a galaxy. The error estimates are the result of the scattering 
method described in §4.2| 

PCI is highly dependent upon M, I , D, M 20 and the 
Gini coefficient. We interpret PCI as a “bulge strength” 
indicator given the correlation with G — M 20 and the impor¬ 
tance of the MID statistics. Fig.[2]shows the relationship be¬ 
tween PCI, Sersic index and the Gini-AFzo “bulge strength” 


(Eq. 15 and 161 the vector of correlations between Gini and 
M 20 ; Snyder et al.|2015 |. Galaxies with low PCI values have 
high Sersic indices and high F indicative of strong bulges, 
while galaxies with higher PCI values have progressively 
smaller bulges and more prevalent disc properties (see ^5] 
for more on the physical and visual properties of specific 
groups). We observe two correlations between F and PCI 
which corresponds to different groups of galaxies. Addition¬ 
ally, the two parallel stripes of data seen in Fig. [2] are the 
result of M=-99 outlier values shifting PCI. We have tested 
the result of setting M=-99 values to M=0 and find that 
the PC eigenweights and the group classifications are very 
similar to our original values. 


F = —0.693 M 20 + 4.95 G - 3.85 (15) 


F(G,M 20 ) 


|F| G ^ 0.14M 20 + 0.778 
-|F| G < 0.14A7 2 o + 0.778 


(16) 


PC2 is highly dependent upon concentration, and is 
larger for galaxies with bright centers and extended en¬ 
velopes. PC3 is dominated by asymmetry and is larger 
for disturbed galaxies. The other principal components are 
harder to interpret, but are also less important as evidenced 
by their lower scree values. It is interesting to note PCI de¬ 
fines a bulge strength but is not dependent on concentration 
(Eq# Concentration for very small (r e < 2 kpc), high Ser¬ 
sic (n > 2.5) galaxies is strongly biased down (see Appendix 
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Figure 2. PCI v. Sersic Index and PCI v. the Gini-M 20 bulge 
strength metric ( F , |Snyder et al.|[2015[ >. PCI is anti-correlated 
with Sersic index and the Gini-ilffeo Bulge factor, F, and thus 
low PCI values are indicative of a strong central bulge. Small 
galaxies can have M = -99 which shifts PCI and leads to the 
two parallel stripes. See ^5] for more on how group 6 galaxies are 
different from the remainder of the sample. 


0 - This bias is potentially important for ~14 per cent of 
our sample. 

We performed tests on how PC A results are affected by 
whitening the data set using the interquartile range (IQR) 
statistic instead of a standard deviation. The eigenvectors 
calculated using either whitening method are mainly con¬ 
sistent. However, we chose to use the standard deviation to 
whiten our data because the PC weights are more volatile 
when calculated with an IQR whitened data set. In partic¬ 
ular, the weight in PC3 describing concentration has a vari¬ 
ance nearly nine times larger when calculated for an IQR- 
scaled data set compared to a standard deviation-scaled data 
set. 


4 PCA-MORPHOLOGY GROUP PROPERTIES 
4.1 Defining PCA morphology groups 


Studies using PCA usually only select the top eigenvectors 
to represent the data. However, this is not a requirement 
of the analysis. In our case, the number of variables is not 
very large and thus retaining the entire parameter space 
is not a computationally expensive procedure. We aim to 
reconstruct the full set of galaxy morphological correlations 
at other redshift ranges by using all 7 PC dimensions to 
represent the data set. The correlations from the higher PC 
eigenvectors may be important at different redshifts. When 
the goal of PCA is to cluster data then reducing the number 
of features based on the amount of variance captured is not 
the only option ( Jolliffe||l986 Ben-Hur fe Giiyori]|2003 L In 
these cases more principal components can better recreate 
the original data set. 


The morphologies of galaxies are not inherently dis¬ 
crete, but rather lie on a continuum. However, it is often use¬ 
ful to bin galaxies into discrete morphological groups. Fig. [3] 
shows the distribution of galaxies when projected onto the 
first three principal axes. Except for a large distinct cluster 
of data points, most of our sample are not well separated, 
requiring the need for an objective data dependent grouping 
method. 


To classify galaxies in distinct groups, we employ 
the Ward hierarchical agglomerative clustering routine of 
scikit-learn ( |Pedregosa et al.~[20 11). Hierarchical cluster¬ 
ing (specifically agglomerative clustering) treats each galaxy 
as its own cluster, which are then merged with nearby 
clusters while minimally increasing the in-cluster variance. 
Mergers of adjacent clusters continue until the desired num¬ 
ber of groupings are attained. We define 10 groups, 2 of 
which are very sparsely populated, with only a combined 12 
galaxies. The sparsely populated clusters consist of extreme 
outliers from the other 8 clusters. For this reason, we group 
all outliers into a single cluster. 


Fig.[4]shows the amount of between-cluster variance cal¬ 
culated for various numbers of clusters. Typically, the opti¬ 
mal number of clusters chosen corresponds to the turnover in 
this distribution (the point where the increase in between- 
cluster variance begins to slow; Everitt & Hothorn 2006 1 
which occurs at ~10 clusters. Increasing the number of clus¬ 
ters any further does not provide any more discriminatory 
power and only complicates interpretations of the final re¬ 
sults. We must note that there is no definitive criterion to 
help define how many clusters are to be defined in the data 
set. 


The hierarchical clustering algorithm defines the groups 
based on the distribution of the data. In order to repro¬ 
duce the same group definitions for new objects with poten¬ 
tially different distributions (e.g. different redshifts), we use 
a convex hull method to define the original group bound¬ 
aries in principal component space. A convex hull defines 
the smallest area containing a set of points. We define con¬ 
vex hulls using the 10 clusters determined by Ward’s method 
for our z ~ 1.5 galaxy sample. In practice we disregard the 
2 sparsely populated clusters and instead group all of those 
galaxies into the outlier class. 

We use all 7 PCs to define the convex hull. Calculat¬ 
ing convex hulls in 7-dimensional space is computationally 
intensive and currently impossible for large data sets, thus 
we outline a simple workaround. We define a convex hull 
based on 2 PC dimensions at a time and test whether a 
galaxy falls within the boundaries of a group using all com¬ 
binations of 2 PC dimensions. The group a galaxy falls in 
the most times is determined to be its group. If more than 
one group is equally likely, the smallest distance from the 
galaxy’s position in PC space to the center of the possible 
groups is used to determine group membership. Galaxies 
that are misclassified following the convex hull method gen¬ 
erally exist on the boundaries of a convex hull. We present 
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Table 1. PC Weights with error estimates based on a bootstrap scattering method 


Parameter 



PCI 



PC2 



PC3 



PC4 



PC5 



PC6 



PC7 

Scree value 



0.41 



0.19 



0.15 



0.08 



0.06 



0.06 



0.05 

Concentration 

-0.06 

± 

0.02 

0.74 

± 

0.01 

-0.35 

± 

0.03 

0.19 

± 

0.04 

-0.31 

± 

0.11 

0.03 

± 

0.12 

-0.43 

± 

0.07 

M 20 

0.48 

± 

<0.01 

-0.03 

± 

0.02 

-0.12 

± 

0.02 

0.16 

± 

0.07 

-0.67 

± 

0.19 

0.07 

± 

0.19 

0.52 

± 

0.09 

Gini 

-0.45 

± 

0.01 

0.27 

± 

0.02 

0.12 

± 

0.02 

0.45 

± 

0.05 

0.11 

± 

0.16 

-0.46 

± 

0.13 

0.53 

± 

0.07 

Asymmetry 

0.00 

± 

<0.01 

0.41 

± 

0.03 

0.82 

± 

0.02 

-0.31 

± 

0.03 

-0.18 

± 

0.05 

0.18 

± 

0.05 

0.06 

± 

0.03 

Multi-mode 

0.38 

± 

<0.01 

0.45 

± 

0.02 

-0.27 

± 

0.02 

-0.30 

± 

0.07 

0.56 

± 

0.11 

0.14 

± 

0.15 

0.40 

± 

0.07 

Intensity 

0.49 

± 

<0.01 

0.04 

± 

0.01 

0.13 

± 

0.01 

-0.13 

± 

0.03 

0.02 

± 

0.10 

-0.82 

± 

0.15 

-0.24 

± 

0.07 

Deviation 

0.43 

± 

<0.01 

0.00 

± 

0.01 

0.30 

± 

0.01 

0.73 

± 

0.04 

0.31 

± 

0.15 

0.25 

± 

0.10 

-0.18 

± 

0.06 


F125W, 1.36<z<1.97 



Figure 3. PCI v. PC2 v. PC3 for our sample of M * > 10 10 Mq, 1.36 < z < 1.97 galaxies, color-coded by their hierarchical cluster 
definitions. PCI anti-correlates with bulge strength, PC2 is dominated by concentration, and PC3 is dominated by asymmetry (see Table 
[T|. Group -1 galaxies (black stars) are outliers from remaining groups, initially they comprised groups 3 and 7. 
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Std Dev Scaled PCA (All PCs) 



Figure 4. The amount of between-cluster variance as a func¬ 
tion of the number of clusters grouped by the Ward hierarchical 
agglomerative clustering routine. The between-cluster variance is 
the sum of the distances from the centroid of each cluster to the 
centroid of all the data. Eventually this value grows to the to¬ 
tal variance in the data when the number of clusters equals the 
number of data points. 


the python code determining the group membership based 
on convex hull grouping^ 


4.2 Morphological Error Estimation 


The Hubble Ultra Deep Field (UDF) consists of deep imag¬ 
ing on a portion of the shallower GOODS-S field ( |Koeke-| 
moer et al.||2011 i. We measure the same galaxies using dif¬ 
ferent depth images to the determine reliability of morpho¬ 
logical measurements as a function of signal-to-noise and 
magnitude. The non-parametric morphologies of galaxies are 
measured both in the deep UDF region and the GOODS-S 
observations. We calculate the differences of GOODS-S mor¬ 
phologies from UDF morphologies. We then bin galaxies in 
magnitude (or surface brightness) to find the average differ¬ 
ence and median absolute deviation, which we define as the 
error for that morphological measurement. 

Fig. shows that larger and brighter galaxies are (un¬ 
surprisingly) well measured morphologically. The median 
absolute deviations (red error bars) show a majority of 
galaxies have statistics that do not vary widely between shal¬ 
low and deep images. In general, the morphological offsets 
seen in Fig. [ 5 ] are very small. (For similar study see Fig. 6 
Grogin et al.|2011 


Now that we have calculated the principal morpholog¬ 
ical components and resulting morphology groups, we can 
test their robustness to measurement errors. We use Monte 
Carlo resampling test to randomly scatter our initial mor¬ 
phological measurements by Gaussians with sigma equal to 
the median absolute deviation for each morphological mea¬ 
surement (Fig. [5j| . We then perform a principal component 


1 https://github.com/mikepeth/PyML 


analysis for this new data set and repeat this process 250 
times. We project the scattered data on the original PC 
weights and then classify the galaxies based on the origi¬ 
nally defined convex hulls ( |4.1[ ) each time. The group with 
a plurality of the reclassifications is defined to be the “Monte 
Carlo” (MC) group. Fig. [bjcan thus be seen as the probabil¬ 
ity distribution function for a galaxy of a certain group to be 
classified into a group via the convex hull method. Group 6 
is the most robustly classified group. Only group 4 galaxies 
are re-classified as such following the Monte Carlo scattering 
to less than a majority of times (however still a large plural¬ 
ity of times). The plots are separated by group and ordered 
roughly by PCI horizontally and PC2 vertically. The largest 
PCI values and smallest PC2 values are in group 6 galaxies. 

Table [l] shows that the most important principal com¬ 
ponents (PCI-3) have typical resampled deviations ^ 10 - 
15 per cent of their weights. Higher principal component 
dimension display greater variability, but are also less im¬ 
portant to our group classifications. 

Every galaxy has a MC reclassification with a proba¬ 
bility associated with it and the group with a plurality of 
the reclassifications is defined as the MC resampled group. 
Regardless of the probability, the reclassification is either 
the same or different from the original group designation. 
This similarity or difference in classification determines the 
completeness and purity of the classification scheme. The 
MC resampled classifications are 90.8 per cent complete and 
90.4 per cent pure relative to original group classifications. 
The completeness and purity are highest when all 7 PCs are 
used to define the groups instead of only 3 PCs. Represent¬ 
ing the data set with 3 PCs slightly drops the completeness 
and purity scores to 88.3 and 89.4 per cent. In contrast, the 
completeness and purity values significantly drop to 25.9 
and 20.3 percent when PCs are calculated from IQR-scaled 
data. The volatility of the eigenvectors calculated from an 
IQR-scaled data set is the cause of these poor reclassifica¬ 
tion results. This evidence leads to our conclusion that using 
all 7 standard-deviation scaled PC eigenvectors will result 
in more definitive groups. 

Note that this does not include the systematic biases 
e.g. those due to the PSF. This bias likely important for 
groups 6 and 0. 


5 PCA MORPHOLOGY GROUPS AT Z ~ 1.5 


The connection between morphology and star-formation has 


been well studied (Wuyts et al. 2011 Kriek et al. 2009 


Brinchmann et al. 2004). Late-type galaxies are typically 


still actively forming stars, whereas early-type galaxies have 
had their star-formation quenched. However, there are ex¬ 
amples of red, quenched disks and blue, star-forming ellip¬ 
ticals which are important rare “transitional” classes. 

We use a UV J color-color diagram (Fig. [7| to clas¬ 
sify galaxies as “star-forming” and “quenched” using the 
bimodality of these two types of galaxies seen in U — V and 


V — J rest-frame colors (Labbe et al. 2005 Wuyts et al. 


2007 Williams et al. 20091. Star-forming galaxies follow a 
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(a) Magnitude vs A (GOODS - UDF) 



-0.4 - 

_,_l_I___3_i_____:_____._ 

24 22 20 

Surface Brightness (F125W AB mag/sq. arcsec) 



24 22 20 

Surface Brightness (F125W AB mag/sq. arcsec) 


(b) Surface Brightness vs A (GOODS - UDF) 

Figure 5. Magnitude and Surface Brightness vs. A[GOODS - UDF] morphological statistics as measured in wide-held imaging of 
GOODS-S compared to the deep imaging of UDF. Red error bars represent the median A morphology value binned in magnitude (or 
surface brightness) bins of 0.5. Error bars represent the median absolute deviation of each bin corresponding to a lcr deviation. 
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Figure 6. Group classification uncertainty, based on bootstrapped morphology measurement errors. Each galaxy’s non-parametric 
morphologies are randomly scattered based on gaussians with widths based on errors found in Fig. [5] The principal components and 
group membership are redetermined 250 times. The resulting MC group distributions for each originally defined group are shown. Groups 
1, 6, and -1 are the most robust to measurement errors, whereas half of Groups 2, 4, 5 and 8 galaxies are scattered into other groups. 
The panels are roughly arranged by PCI (increasing left to right) and PC2 (increasing bottom to top, except for group -1). 


sequence determined by dust extinction. The panels are ar¬ 
ranged in Fig. [7] so that PCI increases along the x-axis and 
PC2 increases along the y-axis. Most groups are primar¬ 
ily comprised of star-forming galaxies. Groups with lower 
PCI values are more compact and quenched. Similarly, a 
UV —Mass diagram separate star-forming from quenched 
galaxies (Fig. [8|. Again galaxies with lower PCI values are 
more massive and more quenched. 


Previous studies (e.g. |Lotz et ah 2004| Conselice 


et al. 2000 Lee et al. 2013| l utilize G — M 20 (Fig. |9j) or 


Concentration-Asymmetry (Fig. 101 diagrams to classify 
galaxies into early and late-type categories. In our study we 
use these tools to reinforce how effective our PCA groups are 
at separating different classes of galaxies. In Fig.[9]the dotted 
lines signify classification regions adapted frorn Lotz et al.| 
(2004) for z~l-2 galaxies observed by HST. Mergers are in 
the upper left region, late-type galaxies are in the lower re¬ 


gion and early-type galaxies are in the wedge-shaped region 
on the rightmost portion of the G — M 20 diagram. C — A 
diagrams (for review see Conselicef2 0141 have been used to 
differentiate giant ellipticals (which live in regions of large C 
and small A) from spirals (with progressively smaller C and 
larger A) and from mergers (which are the most asymmetric 
but the least centrally concentrated). 


For our group descriptions in the following sections we 
will refer heavily to Fig.ffi- mi the example galaxies of Fig. 
|13| - [21] and Tables s- [5] For these figures the locations 
of each subplot represents the approximate position of that 
group in PCA space. From left to right, PCI increases which 
is indicative of an increase in bulge strength. From bottom 
to top, PC2 increases thus concentration increases. 


Tables [ 2 J 5 ] describe the group demographics in terms of 
stellar mass, visual classification (Kartaltepe et al. 20141, 
Sersic indices (van der Wei et al.||2012l and quenched frac- 


© 2015 RAS, MNRAS 000. [T][Ml 














































u-v 


12 M. Peth et al. 



Figure 7. Rest-frame UV J diagram for M» > 10 10 Mg, 1.36 < z < 1.97 galaxies for each group. A UV J diagram is used to separate 
quenched galaxies from star-forming galaxies ( |Williams et al.|2009| l. Quenched galaxies reside in the upper left trapezoid. Star-forming 
galaxies follow a sequence of increasing dust from the bottom left to the upper right. The panels are roughly arranged by PCI (increasing 
left to right) and PC2 (increasing bottom to top, except for group -1). The majority of quenched galaxies are in group 6, with some 
quenched galaxies found in groups 0 and 8. As PCI increases we observe a decrease in the fraction of quenched galaxies. 


tion. These demographics are both listed in terms of the 
original group (as determined by the hierarchical clustering 
method, left columns) and in terms of the MC group (de¬ 
termined using the scattering method, right columns). The 
agreement between the galaxy demographics in the original 
groups and scattered MC groups shows the group character¬ 
istics are quite robust to noise. Table[2]shows that high PCI 
(disk-dominated) groups have very few high mass galax¬ 
ies. Meanwhile, low PCI (compact/bulge-dominated) groups 
have a larger fraction of high mass galaxies. 


We use the CANDELS visual classifications (Kartal- 
tepe et aL|2014l to determine the demography of the PCA 


groups in disk, spheroidal and irregular galaxy classes. For 
a galaxy to be counted as a “disk”, “spheroid” or “irregu¬ 
lar” it must have been classified by at least two-thirds of 
the classifiers as such, and less than one-third as the other 


classes. A “disk+spheroid” is classified as both a disk and a 
spheroid by at least two-thirds of the classifiers. The “other” 
class represents everything that does not belong to the other 
4 categories. The fractions of galaxies in each morphological 
type are shown in Table [3] 

Sersic fits have been used extensively to classify galaxies 
into early- and late-type categories (van der Wei et al.|2012 


van Dokkum et al.|2010| Patel et al. 2011||Peng et al.|2 002). 

Typically, n=2.5 is used to divide late-type (n < 2.5) and 
early-type (n > 2.5) galaxies. Table [4] shows the percent¬ 
age of galaxies representing a certain classification for each 
group as a percentage of the group population ( |van der Wel| 
et al. |[20~i~2 i . Similar to visual classification, the percentage 
of galaxies with disk-dominated morphologies decreases with 
decreasing PCI values. 

Tableland Fig.[7]show that in this redshift range (1.36 
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Figure 8. Rest-frame U — V vs. Stellar Mass diagram for M* > 10 10 Mg, 1.36 < z < 1.97 galaxies for each cluster group. Galaxies 
classified by UV J as star-forming (stars) and quenched (circles) are shown for each group. The dashed line in U — V represents the 
approximate dividing line between quenched and star-forming galaxies. Groups 6, 0, and 9 have the greatest fractions of galaxies with 
large masses (dashed line, M* > 5 X 10 1() Mq). The panels are roughly arranged by PCI (increasing left to right) and PC2 (increasing 
bottom to top, except for group -1). 


< z < 1.97) and mass range (> 10 10 Mg) only 23 per cent 
of galaxies are quenched. Table [5] shows that the quenched 
fraction for a group is anti-correlated to PCI and PC2. 

shows the effective radii (kpc) - stellar mass 


Fig- 


11 


relation for each group. In this figure, PCI and PC2 are 
strongly correlated a galaxy’s compactness. Group 6 galaxies 
are by far the most compact, with the largest fraction of 
quenched galaxies. As PCI and PC2 increase the number of 
quenched galaxies in each group decreases. 


Group 6 

Constituting 37 per cent of the entire sample, group 6 is by 
far the most populated group at z ~ 1.5 (example postage 
stamps in Fig. 131 . Group 6 galaxies are characterized by 
their compact sizes (r e ~ 1.57 ± 0.81 kpc) and smooth fea¬ 
tures. Many of these galaxies are barely resolved by HST 


WFC3 which leads to their structureless appearance. There¬ 
fore, the structural properties of this group should be in¬ 
terpreted with caution, since it is possible that unresolved 
features in these galaxies would cause them to be classified 
as a different group if we had access to higher resolution 
observations. 43 per cent of the group is quenched, which 
represents 72 per cent of all quenched galaxies at this red- 
shift. Groups 0 and 4 are the only other group with a >10 
per cent fraction of quenched galaxies. 

Group 6 galaxies also dominate the high mass galaxies 
at this epoch, constituting 48 per cent of galaxies with 5 x 
10 10 Mq < A-L < 10 11 A/g and 49 per cent of galaxies with 
M* > lO n M 0 . 

Group 6 galaxies have low concentrations (C ~ 3.03 
± 0.40), moderate Gini coefficients (G ~ 0.53 ± 0.05), low 
M 20 (~ -1.67 ± 0.17), extremely low MID values (I ~ 0.00 
± 0.02, D ~ 0.06 ± 0.04), and low asymmetry values (A ~ 
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m 2Q m ;0 m 20 

Figure 9. G — M 20 for each group. Overplotted are the dividing lines between: mergers (top left corner ), bulge-dominated ( right-most 
region ), and disk-dominated (bottom left region) modified from |Lotz et a l. (2004). Group 0 fully occupies the bulge-dominated region of 
the plot. Symbols same as Fig. [8] The panels are roughly arranged by PCI (increasing left to right) and PC2 (increasing bottom to top, 
except for group -1). 


0.05 ± 0.06). The G — M 20 diagram classifies the majority of 
these galaxies as borderline disk/spheroidal (with occasional 
irregular classification). However, M 20 values are potentially 
biased because the 20 per cent light is not resolved. These 
galaxies have large average Sersic indices (h ~ 3.11). 

Group 6 is comprised of the highest percentage of vi¬ 
sually identified spheroids (52 percent) and disk+spheroids 
(26 percent), and also has the lowest percentage of disks (13 
percent) of any group. 

Upwards of 26 per cent of group 6 galaxies are small 
(r e < 2 kpc) with high Sersic (n > 2.5) which could re¬ 
sult in an underestimation of concentration and PC2 values. 
These galaxies would instead be classified into group 0. See 
Appendix B for more discussion. 


Group 0 

Group 0 galaxies are characterized by a strong bulge compo¬ 
nent which is surrounded by a faint smooth extended com¬ 
ponent (example postage stamps in Fig. 14 1 . A significant 
fraction of group 0 galaxies are quenched galaxies (26 per¬ 
cent; Table [5|. Although group 0 galaxies make up only 13 


per cent of the galaxies in the sample, they constitute 38 per 
cent of the galaxies more massive than 10 11 Mq (Table [ 2 J. 


These galaxies have high concentration values (C ~ 3.80 
± 0.78), low M 20 (~ -1.80 ± 0.17), high Gini coefficients 
(G ~ 0.55 ± 0.04), low deviations (D ~ 0.06 ± 0.04), low 
multi-modes (M ~ 0.03 ± 0.04), low intensities (I ~ 0.03 ± 
0.04) and low asymmetries (A ~ 0.06 ± 0.07). This group 
of galaxies is the only class to fall almost entirely into the 
spheroidal region of the G — M 20 diagram. 


Visually, these galaxies have a large disk+spheroid frac¬ 
tion (33 percent), a large spheroid fraction (35 percent), a 
moderate disk fraction (31 percent) and a very low irreg¬ 
ular fraction (1 percent). Parametric fits find that group 0 
galaxies have moderately sized effective radii (r e ~ 3.13 ± 
1.97 kpc) and large average Sersic indices (n ~ 3.87). The 
visual classifications and distribution of Sersic indices agree 
with G — M 20 measurements and thus describe the proto¬ 
typical group 0 galaxy as bulge-dominated with a faint disk 
component or extended envelope. 
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Figure 10. Concentration - Asymmetry for each group. Plotting symbols same as Fig. [8] Groups 9 and 1 have the highest asymmetry, 
while group 0 has the highest concentration. The panels are roughly arranged by PCI (increasing left to right) and PC2 (increasing 
bottom to top, except for group -1). 


Group 9 

Group 9 is characterized by their asymmetric, irregular mor¬ 
phologies and strong bulge component (example postage 
stamps in Fig. |15| ). These galaxies make up a significant por¬ 
tion of the M* > 10 11 Mq galaxies (15 percent). However, 
most of these galaxies are lower mass (A/* < 3 x 10 10 Mq). 
Only 8 per cent of group 9 galaxies are quenched. 

These galaxies have moderate concentrations (C ~ 3.70 
± 0.70), moderate Gini coefficient (G ~ 0.52 ± 0.05), mod¬ 
erate M 20 (~ -1.40 ± 0.27), moderate MID values (M ~ 
0.14 ± 0.14, I ~ 0.21 ± 0.18, D ~ 0.19 ± 0.09) and hi g h 
asymmetry (A ~ 0.21 ± 0.10). These galaxies lie along the 
G — M 20 merger/disk galaxy dividing line and also overlap 
with the spheroidal region. 

Group 9 galaxies have large radii (r e ~ 3.67 ± 1.64 kpc) 
and moderately low average Sersic indices (n ~ 2.11). 

This group is the most visually irregular group (24 per¬ 
cent), and has a relatively low disk fraction (41 percent), 
spheroid fraction (13 percent) and disk+spheroid fraction 
(11 percent). These statistics and visual classifications im¬ 


ply many galaxies have bright off-center clusters, in addition 
to bright central bulges. 

Group 4 

Group 4 galaxies are low-mass, smooth, extended galax¬ 
ies with moderate central concentrations (example postage 
stamps in Fig. 1 16[ >. Although mostly star-forming, group 4 
contains some quenched galaxies (~11 percent). Some galax¬ 
ies are extended and also quenched; meaning they are rare 
“red disk” population. None of the group 4 galaxies are more 
massive than M * > 10 n MQ. Primarily these galaxies are 
lower mass (M* < 3 x 10 1o Mq). 

Group 4 has moderate concentrations (C ~ 3.53 ± 
0.66), moderate Gini coefficients (G ~ 0.49 ± 0.04 ), high 
M 20 (~ -1.11 ± 0.24), low intensities (I ~ 0.05 ± 0.05), 
small multi-mode values (M ~ 0.07± 0.07) , low deviations 
( D ~ 0.10 ± 0.07), and low asymmetry (A < 0). 

Group 4 galaxies have moderate effective radii (r e ~ 
3.13 ± 1.63 kpc) and medium average Sersic indices (n ~ 
2 . 68 ). 
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Figure 11. Effective radii (kpc, as measured in WFC3 F160W by |van der We]_et_al. 2 012] > vs. stellar mass for each group. Dotted lines 
represent the “compact” criteria (M/r 1 ' 5 < 10.3Mq kpc -1 ' 5 ) of |Barro et al.| ( |2013[ )■ Almost all group 6 galaxies are very compact, with 
most galaxies smaller than 2 kpc. Groups 0 and 8 have a number of borderline compact galaxies. The remaining groups have only a few 
compact galaxies at most. The panels are roughly arranged by PCI (increasing left to right) and PC2 (increasing bottom to top, except 
for group -1). 


Group 4 members are primarily visually classified as 
disks (51 percent) or disk+spheroids (24 percent) and are 
less classified as spheroids (17 percent) or irregulars (0 per¬ 
cent). 


Group 8 is dominated by visually-classified disks (74 
percent) with only a modest fraction of spheroids (10 per¬ 
cent). A small number of galaxies are quenched and compact 
which overlaps with groups 0 and 6. 


Group 8 


Group 1 


Group 8 galaxies are an interesting class of bulge+disk sys¬ 
tems with dominant and smooth disks (example postage 


stamps in Fig. 171. This class is dominated by low-mass star¬ 
forming galaxies, but also includes low-mass (< 3 x IO^’Mq) 
quenched galaxies (~ 8 percent). Very few galaxies have stel¬ 
lar masses > 5 x 10 10 Mq. 

Group 8 galaxies have small concentrations (G ~ 3.05 
± 0.43), moderate Gini coefficients (G ~ 0.46 ± 0.03), mod¬ 
erate M 20 (~ -1.56 ± 0.17), low but non-zero MID values 
(M ~ 0.06 ± 0.06, / ~ 0.10 ± 0.11, D ~ 0.09 ± 0.05), and 
low asymmetry values ( A ~ 0.08 ± 0.07). On the G — M 20 
diagram these galaxies fall within the disk-dominated region 
but are close to the spheroidal/disk dividing line. 

Sersic fits to this class find moderate sizes ( r e ~ 3.48 ± 
1.89 kpc) and low average Sersic indices (n ~ 1.46). 


Group 1 galaxies are primarily large disks and irregulars 
with bright off-center star-forming knots (example postage 


stamps in Fig. 181. None of these galaxies are quenched 


based on their UVJ colors. The distribution of masses is 
heavily weighted towards lower mass galaxies with very few 
objects more massive than 3xl0 10 Mq. 

Group 1 galaxies have low concentration values (G ~ 
2.76 ± 0.40), low Gini coefficients (G ~ 0.43 ± 0.04), high 
M 20 (~ -1.07 ± 0.17), moderately high asymmetry values 
(A ~ 0.13 ± 0.11), large multi-mode values (M ~ 0.40 ± 
0.27), high deviations (D ~ 0.37 ± 0.13) and large intensities 
(I ~ 0.61 ± 0.24). The high A and MID statistics indicate 
many of these galaxies have bright off-center clusters and 
are potentially irregular. 

The visual classifications and Sersic indices primarily 
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Table 2. Group percentages by mass range for both original group and “MC Group” 



Group 

10.0 < log M, < 10.5 

10.5 < logM* < 11.0 

log 

> 11.0 

Group Percentage 


6 

49.it!;? 

5i-ot!;! 

45.2t|;S 

43.8+;? 

5.6+'? 

5-1+1 

37.3 (462) 

31.1 (385.2) 

Low PCI 

0 

46.it!;! 

42.it!;? 

41-8ti1 

45.4t|;g 

I2.it?;! 

12.4tl 8 7 

13.3 (165) 

7.0 (87.2) 


9 

71-8++ 

65.7+;? 

23.lt!® 

29-4tt! 

5-lt 3 2 9 5 

4 9”*~ 2 ’2 

1.7 

6.3 (78) 

13.4 (165.4) 



50.9+;? 

53.6+.? 

42.0tg;? 

40.3ti;J 

7 -| +0.3 

0 1+0.5 
°- 1 -0.4 



Mid PCI 

4 

76.6+'| 

70. it y.i 

23.4+1 

28.2t|;g 

u>u -0.4 

i 7 + 2 - 4 

-1.3 

9.0 (111) 

8.4 (103.7) 

8 

70.9 +;? 

70.6+;? 

26.9+1 

27.5+1 

o 9 + 2.1 

z,z -1.3 

i q+ 2 -3 
1 * y —1.3 

10.8 (134) 

9.2 (114.0) 



79 C + 0.9 

<o. 0 _o 4 

70.4+;? 

OK O+0.8 

zo -°-0.3 

27.8+1 

1 2”*" 0 ' 7 
■ L,z — 0.2 

i-s+l 




1 

66.3t|;g 

66.it!;! 

33.7+'? 

31.9+1 

O.oto.i 

2 -ot?;g 

7.7 (95) 

11.4 (140.9) 

High PCI 

2 

74.3++ 

67.3+'! 

25.7t!;| 

30.7+1 

o.ot?;? 

2 -ot?;l 

5.6 (70) 

9.3 (115.4) 


5 

76.6t|;S 

64.2+;? 

23.4+? 

32.5t|;? 

o.otj;? 

3 . 2 +;? 

8.6 (107) 

9.2 (114.1) 



72 4+ 9-8 

' z -^-0.4 

65.9+;? 

27.6tg;g 

31.7t?i 

o.ot!;? 

9 4+0.7 
Z - 4 -0.4 



Outliers 

-1 

co 0+24.1 

oo.o_ 18t0 

50.2t?!j 

4i-2t?i:l 

37.4+?;? 

0.0t?°9 6 

19 r + 18.9 
1Z, ° —10.2 

1.4 (17) 

1.0 (12.9) 

N Galaxies 


746 

440 

58 

1244 


Note: The left hand columns for each mass range represent the demographics based upon the original group based on hierarchical 
clustering. The right hand columns are based on the total group probabilities based on the scattering technique classifications. 


Table 3. Demographics of Visual Classifications of Groups 



Group 

Disks 

Spheroids 

Irregulars 

D+Sph 

Other 


6 

12.5+1 

19.0tg;| 

52.5+;? 

47 4+ 0 - 9 
4/,4 -o.5 

1 o +0 - 9 
i - U -0.6 

1 6 +0 - 7 
0.2 

25.6+;! 

23.7+1 

0 c+ 1-9 
°-°-1.6 

q q+0.7 

Low PCI 

0 

si.ot?;? 

33.8+1 

34.5+;? 

31.8+1 

0 9+ 20 
u,y -1.0 

1 q+8-0 
i '°-0.8 

32.7+;? 

31.5+'? 

q 9+2.0 
u ' y -1.0 

1 1+3-0 
i - 1 -0.8 


9 

41 2”*" 10,6 
4i,z — 8.6 

42.9+'.! 

13.7+i 

23.8+;? 

90 c+8.6 
zo -°-6.6 

10 7+ 1 - 8 
' —0.6 

11.8+;? 

15 9” 1 ” 1 ' 8 
lo>y -0.6 

9.8+; 4 3 

Q + l- 8 
b - 8 -0.5 



20.0+;? 

27.1+3 

43.9+j 

39.3+;? 

3-4+;? 

0 q+0.7 
0.5 

25.8+;? 

22.9+'? 

6.8+;? 

6 9+ 0,9 
o,y -0.6 

Mid PCI 

4 

5 1 - 4 1|1 

49.7+1 

17.1+;! 

20.8+;? 

°.ot?;f 

2 4+ 2 - 8 
Z '^—0.8 

24.3+;? 

21 0 +2 ' 9 
Z1-U —0.9 

7 l+ 4 -6 
‘ — 3.1 

6 1+2-8 
°- 1 -0.8 

8 

73.8t?° 0 1 

58.5+;? 

9.5+;? 

19.2+;? 

3.6+3 

9 q+ 2 - 5 
z,y —0.7 

6.0+;! 

12 9+ 2 - 6 
lz - y —0.8 

7 1+ 4 - 1 

‘ • 1 - 2.9 

6 6" 1 " 2 ' 6 
°-°-0.8 



63-6+! 

54.3+1 

13 0" 1 " 1 ' 2 

J-O.U-o 4 

i9.9tl;? 

1 Q + 1 - 2 

±,y -0.3 

9 7+2.O 
z ‘ —1.2 

14 3"*" 1 ' 2 

i6.7t|;l 

7.1+1 

g q + 2.6 
d, °-1.8 


1 

79 1+11.3 
' Z ' — 9.8 

64 2”*~ 2 ' 2 

°^- z - 0.9 

1 c+3.3 
—1.6 

11 6 +1 ’ 9 

1 —0.6 

16.2+? 

8 9 +1-9 
o,y -0.6 

1 c+3.3 
a,0 -1.6 

9 Q+ 1 - 9 
y ’ u -0.6 

8 . 8 +;! 

6 4 +1,9 
D -^- 0.6 

High PCI 

2 

76.3+?;? 

56.0+? 

7 g + 7 - 4 
* ,y -4.6 

01 q+2.6 

ZL -°-0.9 

5.3+1 

0 7+2.5 
' — 0.7 

7 q + 7 - 4 
' ,y -4.6 

12 5 + 2 - 5 
±z -°- 0.8 

2 g+ 5 - 9 
z ‘°- 2.9 

6 l + 2,5 
d,1 -0.7 


5 

95.5t!?;! 

6i.5+;? 

0 0~*~ 2 ' 7 
U-U - 0.8 

i7.6+;| 

3-0+;! 

4 0+2-5 

0 0”*” 2 ' 7 
U,U - 0.8 

12 5 + 2 - 5 
±z -°- 0.8 

1 5+3- 4 
i -°-1.6 

4-1+;? 



82-0+1 

60.9+;? 

2 0+11 

z '° —0.3 

16.5+;? 

8 -7+'i 

5-9+3 

2 3"*" 11 
Z '° —0.3 

ii-itl:! 

4 7+1-1 
^•'-0.3 

5.6ti;? 

Outliers 

-1 

40.0t?!;l 

99 1 +21.9 
OZ- —6.3 

30 . 0 ++ 

19-4t??i 8 

20 0+ 25,5 
zu - u _ 14 .5 

18.8++ 

0 o+ 18 -° 

U - U -5.0 

19 . 4 ++ 

10 o + 22 - 3 

10 . 2 ++ 

Total Fraction 


42% (337) 

29% (233) 

4% (36) 

18% (147) 

6% (52) 


Note: Visual classification from Kartaltepe et aI7| ( ^20 14) for UDS and GOODS-S (no classifications for COSMOS galaxies). For a galaxy to be visually 
classified 2/3 observers need to agree. ‘Other’ classification refers to galaxies failing the 2/3 agreement requirement. The left hand columns for each visual 
classification represent the demographics based upon the original group based on hierarchical clustering. The right hand columns are based on the total 
group probabilities based on the scattering technique classifications. 

classify this group as disk galaxies and/or irregulars. Group indices (h ~ 0.63) imply a large disk and irregular popula- 

1 is dominated by visually-classified disks (72 percent) and tion. 

has a relatively large fraction of irregulars (16 percent). This 
group has very few spheroids or bulge-dominated disk galax¬ 
ies. Their effective radii are large for this redshift and mass 
(r e ~ 5.35 ± 1.43 kpc). This group has low average Sersic 
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Group 2 

Group 2 galaxies are primarily low-mass , star-forming, 
smooth disk galaxies with high central concentrations and 
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Table 4. Demographics of Sersic Classifications of Groups 



Group 

0 < n < 1 

1 < n < 2.5 

2.5 < 

n < 4 

n 

> 4 


6 

36.61?® 

13.31!;! 

19.5l|? 

01 9 +O .9 
O 1 * z -0.4 

99 1 +2.9 
zz>± - 2.6 

34.61!® 

13.41?;? 

20.91!® 

Low PCI 

0 

30.7!?;® 

6 . 1 !?:? 

18-llH 

25.0I?? 

is.ill? 

28.41?'? 

25.31?;! 

40.51?-? 


9 

53.2l “ 8 7 

30.olg;J 

26.61?;! 

38.ol?;? 

9 c+4.6 
z -°-2.4 

1 9 7 + 2.0 

LZ -‘-0.7 

7-611.1 

19.31?;? 



37.0t°j 

i6-3li;! 

19.91°;! 

32.0I!; 7 

19.0I!;? 

28.41?;® 

15.61!;? 

23.31?;? 

Mid PCI 

4 

47.71?? 

23.21?-? 

25.21 7 ' 2 

43.71?? 

8 -il|;l 

I6.0I?? 

10.81?;! 

I7.ll?;? 

8 

60.01!;? 

38.61?;? 

20 . 0 l?;? 

37.41?;? 

4 4 + 3-6 
^-2.3 

13-21?;! 

3.0l?;? 

10.81?;® 



54 4+ 1 - 3 

31.21®;! 

22 4+ 12 

40.51?;? 

e.ilo.l 

14.611® 

c c + 1-2 

O-O_o.4 

13.81?;? 


1 

00 9 + 12.0 
— 10.6 

68.71?-? 

0 9 + 3.8 

2.2 

12.61?? 

0 . 0 l?;? 

0 0 + 2-2 

8.8_o7 

1 1+3-1 

9 9+ 2 ’2 
y,y -0.7 

High PCI 

2 

71 4+ 15 - 8 
' 13.1 

39.21?! 

12.91?? 

33.91?? 

1 4 +5 - 4 
1 , 4 -2.3 

14 q+3-0 

1 4+ 5 - 4 

1 '^-2.3 

12 6+^° 
lZ.O_o.9 


5 

84-i±iS:i 

55.2l?;i 

4.7l?;? 

16 5+ 2-9 
lO.O _ 0i9 

0 0+ 2,7 
U ' U - 0.8 

14.51?;? 

fi fi + 2 ^ 
u,u —0.8 

13.91?;® 



80.91J;! 

55.71?;® 

5 Q + l-l 

°- y -0.3 

20.21?;? 

f) q+!-0 
u -' 3 -0.3 

12 . 2 I?;! 

o.7l?;! 

12.01?;! 

Outliers 

-1 

78.9l^;I 

60.8l? 2 6 2 

fi n+ 180 

U - U -5.0 

19.6l? 2 2 ° 

fi f)+ 18 -° 
U - U -5.0 

O.Ol ? 1 ! 8 

15.81?!;? 

19.5l? 2 2 ° 

Total Fraction 


56% (630) 

19% (213) 

13% (150) 

12% (131) 


Note: The left hand columns for each Sersic-index range represent the demographics based upon the original group based on 
hierarchical clustering. The right hand columns are based on the total group probabilities based on the scattering technique 
classifications. Due to the small sizes of certain galaxies, not every galaxy has a measured Sersic fit. 


Table 5. UV J Quenched Fractions of Groups 



Group 

Quenched 

Star-Forming 


6 

43.51?-? 

39.3l|;? 

56.511;! 

60.5!?;? 

Low PCI 

0 

25.91?;? 

27.0l®;| 

74.1!?;! 

73.0l?'y 


9 

76!?;? 

15 1 +3-5 
-*- 0-1 — 2 .9 

92.4!??;® 

84.91?;? 



35.4!!;? 

o-t 0 + 1-0 
°- l -°- 0.8 

64.6!!;? 

68.5!?;? 

Mid PCI 

4 

8 

15 3+ 4,5 
io.o_3 6 

0 1 +3.1 

8 ' i -2.4 

17.2!?-® 

13.9l|l 

84.71?;? 

91.9!?;! 

82.8+1? 

86.it!;! 



11 4 + 0.8 
li -^- 0.2 

15.5!?;? 

88.61!;® 

84.5!?;? 


1 

O-Olo'.s 

0 q+3.1 
o-o_ 2.3 

100.0l?° 8 8 

91.71?-? 

High PCI 

2 

O.Ol?;® 

15.0!?;? 

100.ol??? 

84.91®;? 


5 

0 9+ 21 
u - y -i.o 

13.9!?;? 

99.l!?°2 1 

86.ll!;! 



o-4l!;? 

i2.il?:? 

99-61!;! 

87.8!?;? 


-1 

10.5l? 3 6 4 

14 0+15.7 
A 4 * z -9.3 

89.5!?®;? 

85.81??; 7 

Total Fraction 


23% 1 

(281) 

77% (962) 


Note: Quenched/star-forming classifications based on Fig. [7] The left hand 
columns for quenched/star-forming classifications represent the demographics 
based upon the original group based on hierarchical clustering. The right hand 
columns are based on the total group probabilities based on the scattering tech¬ 
nique classifications. 


few visually detected star-forming knots (example postage 
stamps in Fig. 19). None of these galaxies are quenched. 
The mass distribution for this group is a steeply declining 
function where there are only a few galaxies with masses 
> 3 x 10 lo M e . 


Group 2 galaxies have large concentrations (C ~ 4.81 
± 0.62), low Gini coefficients (G ~ 0.45 ± 0.04 ), moderate 
M 20 (~ -1.20 ± 0.24), low asymmetry (A ~ 0.06 ± 0.08), 
low deviations (D ~ 0.16 ± 0.09), moderate multi-modes 
(M ~ 0.16 ± 0.21), and a wide spread of intensity values 
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(/ ~ 0.29 ± 0.29). On the G — M 20 diagram these galax¬ 
ies fall within the disk-dominated and irregular portion of 
the diagram. However, their high C values suggest a bright 
nuclear component. 

The visual classifications show this group is dominated 
by disks (76 percent) and only small fractions of irregular 
galaxies (~5 percent) and disk+spheroid galaxies (~ 8 per¬ 
cent). They have mid-sized effective radii (r e ~ 3.52 ± 0.83 
kpc) and mid-to-low average Sersic indices (h ~ 1.10). 


Group 5 

Group 5 galaxies are primarily low-mass (M* < 3x 1O 1O M0), 
star forming, extended disk galaxies with a weak bulge com¬ 


ponent (example postage stamps in Fig. 201. This group has 
a negligible fraction of quenched galaxies (~ 1 percent). 

Group 5 is mostly comprised of low concentration val¬ 
ues (C ~ 2.87 ± 0.42), low Gini coefficients (G ~ 0.40 ± 
0.03), low/moderate M 20 (~ -1.20 ± 0.19), a wide spread in 
multi-mode (M ~ 0.26 ± 0.24), large intensity values (I ~ 
0.52 ± 0.28), low deviation values (D ~ 0.12 ± 0.06), and 
low asymmetry values (A ~ 0.03 ± 0.12). On the G — M 20 
diagram these galaxies fall solidly within the disk-dominated 
region. 

The defining feature of this group is its large typical 
size (r e ~ 5.47 ± 1.81 kpc). Group 5 galaxies have very 
low average Sersic indices (n ~ 0.65); implying a disk- 
dominated/irregular population. 

Visual classification indicate group 5 is comprised al¬ 
most entirely of disks (95 percent), and a few irregulars 
(3 percent). This group has no visually identified bulge- 
dominated or spheroidal galaxies and are not clumpy. 


Group -1 


The original groups 3 and 7 were comprised of only a few 
galaxies each (19 in total, example postage stamps in Fig. 
21 ). They were outliers from the remaining groups and are 
combined into a single outlier group. These galaxies are 
most likely outliers because they have at least one poorly 
measured (or missing) morphological parameter (especially 
the multi-mode statistic). These galaxies have a low surface 
brightness, very large radii (r e ~ 6.73 ± 2.30 kpc), low con¬ 
centration (G ~ 2.21 ± 0.74), high intensity (/ ~ 0.44 ± 0.39 
), high M 20 (~ -0.99 ± 0.26), low Gini coefficient (G ~ 0.41 
± 0.10), extremely high deviations (D ~ 0.69 ± 0.49) and 
high multi-modes (M ~ 0.53 ± 0.39). The deviation values 
can separate group -1 galaxies from all the other groups. 


6 DISCUSSION 

The spatial distribution of light for galaxies is a snapshot of 
the orbital paths of the constituent stars, gas, and dust. The 
morphology of a galaxy informs us of the merger and gas- 
accretion history in ways integrated colors, spectral-energy 
distributions and stellar mass cannot directly probe. 



Figure 12. Cumulative quenched fraction rank ordered by var¬ 
ious metrics: PCI, PC2, PC3, stellar mass, Sersic-n and “com¬ 
pactness” . The green solid line represents no correlation between 
quenched fraction and rank. Sersic-n and PCI have a similar CQF 
shape, where n is less contaminated by quenched galaxies at low 
values but PCI is less contaminated at high values. 


Using a Sersic index, bulge-dominated galaxies are tra¬ 
ditionally defined to have n > 2.5 (e.g. [Bruce et al.| [2014a|). 
For the purposes of of our PC classifications we define 
galaxies with low PCI values as bulge-dominated (the con¬ 
stituents of groups 0, 6 and 9). These two definitions lead 
to differences in the characteristics of what are defined as 
‘bulge-dominated’ and we will explore these differences in 
the following sections. 

The connection between morphology and star- 


formation has been well studied (Wuyts et al. 2011 Kriek 
|et al.||2009| |Brinchmann et al.||2004| ). Late-type galaxies 
are typically still actively forming stars, whereas early-type 
galaxies have had their star-formation quenched. However, 
there are examples of red, quenched disks and blue, star¬ 
forming ellipticals which are important rare “transitional” 
classes. In our study we delve deeper into the correlations 
between morphological type and star-formation and how 
the connection between them is not always clear-cut. 


6.1 Stellar Mass - Quenching Connection for 
groups 

Fig. [12] shows the cumulative distribution of the quenched 
fraction rank-ordered by “compactness” (M/rf 5 < 10.3 Mg 


kpc , Barro et al. 2013), stellar mass, Sersic-n, PCI, PC2 


and PC3. For every galaxy we assign a 0 to star-forming 
galaxies and 1 /n quenc hed for quenched galaxies (as deter¬ 
mined by the UV J diagram, Fig. [7| and then these values 
are cumulatively summed. We observe a flat trend in stel¬ 
lar mass and PC2, and a much steeper trend in PCI and 
Sersic-n. However, Sersic-n has previously been shown to 


correlate well with quenching (e.g. Wuyts et al. 2011 Bell 
et al.||2012 1 . The similarities in steepness between the PCI 
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and Sersic-n curves show PCI is an equivalently useful pre¬ 
dictor of quenching. 

We also investigate the relationship between quench¬ 
ing and PCI through the color-mass relation. In Fig. [8] we 
observe a correlation between the increase in the fraction 
of massive galaxies (> 5 x 10 1o Mq) for a specific group 
and the magnitude of PCI (bulge strength). The amount of 
quenched galaxies correlates more strongly with PCI (bulge 
strength) than PC2 (concentration). Similar results have 
been found for z ~ 1-2 galaxies (Bell et al. | [ 2012] |Barro| 
et al.|20l3||Lang et al.|2014 l. Unsurprisingly, the most mas¬ 


sive galaxies are also the most likely to be quenched. For 
instance, group 6 has the largest amount of red galaxies 
and many massive galaxies (> 5 x 10 1o Mq). The only other 
groups with a substantial number of quiescent galaxies are 
groups 0 (26 percent) and 4 (15 percent). Group 0 galaxies 
are primarily bulge-dominated with a faint disk. However, 
group 6 galaxies are more massive (> 5 x 10 1o Mq) than 
group 0 galaxies. Furthermore, a much larger percentage of 
these massive galaxies in group 6 are quenched. Group 9 
galaxies are slightly less massive (log M* ~ 10.4), but still 
generally have a strong bulge component (as determined by 
PCI). 

Group 4 and 8 galaxies fall between the extremes of the 
bulge-dominated groups (0, 6 and 9) and the disk-dominated 
groups (1, 2 and 5) in stellar mass, bulge strength and 
quenched fraction (see Tables |2]-[5|. The galaxies of groups 
4 and 8 are more bulge-dominated than the disk-dominated 
galaxies which would explain the larger quenched fraction. 
Groups 4 and 8 galaxies are not as massive as those in the 
bulge-dominated groups 0, 6 and 9 (Tabl e [2| and are not 
quenched to the same extent either (Table |5| . 


6.2 The relationship between PCA Classes and 
Visual/Sersic Classifications 

PCA, in conjunction with our group finding algorithm, pro¬ 
vides a distinct picture of galaxy structure from Sersic in¬ 
dex and visually based classifications. This classification 
scheme also separates quenched compact galaxies (group 6) 
from larger, smooth proto-elliptical systems (group 0), and 
star-forming disk-dominated clumpy galaxies (group 1) from 
star-forming bulge-dominated asymmetric galaxies (group 
9). Separating clumpy star-formers and bulge dominated 
star forming galaxies has great importance for understand¬ 
ing the mechanisms that formed these galaxies and the po¬ 
tential avenues for evolution available to them. 

Based upon the visual and Sersic classifications, our 
groups belong to 3 distinct types: the “disk-dominated” 
galaxies of groups 1, 2, and 5; the “compact/bulge- 
dominated” galaxies of groups 0, 6, and 9; and the “inter¬ 
mediate” galaxies of groups 4 and 8. For the purposes of our 
discussion we refer the reader to Figs. [7] - m the example 
galaxies of Figs. [T3]-[M] and Tables [2] - [5] 


6.2.1 The Compact and Bulge-Dominated Galaxies: 
Groups 0, 6 and 9 


Galaxies in groups 0,6, and 9 display a variety of visual clas¬ 
sifications, but have a single unifying characteristic: many 
of these galaxies are bulge-dominated. Group 6 galaxies are 
very small and compact (r e ~ 1.57 ± 0.81 kpc) with no 
discernible stellar envelope. Group 0 galaxies are slightly 
larger (r e ~ 3.13 ± 1.97 kpc) than group 6, and display 
evidence for an extended stellar envelope. Groups 0 and 6 
display some distinguishing characteristics as well. Group 6 
galaxies lower measured concentrations (C ~ 3.04 ± 0.40) 
than those in group 0 (C ~ 3.80 ± 0.78). The small sizes 
and lower concentrations for group 6 galaxies are due to the 
fact that r 2 o measurements are near or below the resolution 
limit of the survey. Additionally, r so measurements are very 
small for group 6 compared to galaxies in all other groups 
(see Appendix 0 - 


The size-mass (Fig. 11 1 relation for these two groups is 
different as well. Group 6 galaxies are smaller but overlap 
in masses with group 0 galaxies. Thus many more group 6 
galaxies are compact using the Barro et al. (2013) defini¬ 
tion. Compact galaxies in group 6 are quenched, whereas 
the quenched galaxies of group 0 are more extended. 


Group 6 galaxies are visually classified as bulge- 
dominated (either pure spheroid or disk+spheroid morphol¬ 
ogy) >78 per cent of the time. However, a Sersic cut of 
n > 2.5 yields only 35 per cent. Similarly for group 0 
galaxies, 66 per cent of galaxies are visually classified as 
bulge-dominated, but a Sersic classification only indicates 
43 per cent are bulge-dominated galaxies. Meanwhile, group 
9 galaxies are the most visually disturbed group (26 per 
cent irregular) and have bright central bulges determined 
by PCI. 


Classifications based on PCs provide a slightly differ¬ 
ent picture from those based on Sersic-n or visual inspec¬ 
tion. A PC classification determines ~57 per cent of galax¬ 
ies are compact/bulge-dominated (groups 0, 6 and 9) while 
visual classifications determine ~47 per cent of galaxies are 
bulge-dominated (either pure spheroids or disk+spheroids) 
and Sersic indices classify ~25 per cent of galaxies as bulge- 
dominated (n > 2.5). The differences between the classifi¬ 
cation schemes are subtle but important because they mean 
each is probing a slightly different subset of galaxies. 

The compact/bulge-dominated nature and high masses 
of these 3 groups could imply an evolutionary connection. 
In this scenario, galaxies begin as group 6 galaxies, a naked 
core with no extended envelope or structure. Following a 
gas-rich merger disturbed tidal features become visible and 
the galaxy becomes classified as group 9. After a sufficient 
time for the gas to settle in a disk or spheroidal envelope 
(> 1.5 Gyr) the galaxy would appear as a group 0 galaxy. 
In this scenario, the quenched galaxies of group 6 have star- 
formation reignited following the merger, only to once again 
fade during the disk settlement period. Mergers would thus 
be a major mechanism for triggering disk growth. 


© 2015 RAS, MNRAS 000, [TlfTt] 











PC A of CAN DELS Morphology 21 


6.2.2 The Disk-dominated Galaxies: Groups 1, 2 and 5 

Groups 1, 2 and 5 all have an overwhelmingly large per¬ 
centage of visually classified disk galaxies (72 per cent, 76 
per cent, and 96 per cent respectively). Sersic classifications 
largely agree with the visual classifications for these groups. 
The only difference is that Sersic classifications yield more 
disk-dominated galaxies (1 < n < 2.5) than visual classifi¬ 
cations would indicate. Non-parametric morphologies deter¬ 
mine these disk galaxies have varying degrees of dumpiness 
and disturbances. 

Group 1 galaxies are the most disturbed of the “disk- 
dominated” groups. They have the largest asymmetries 
(A ~ 0.13 ± 0.11), multi-modes (M ~ 0.40 ± 0.27), in¬ 
tensities (/ ~ 0.61 ± 0.24) and deviations (D ~ 0.37 ± 
0.13). They are more often visually classified as irregular (16 
percent), but have a weaker bulge component (indicated by 
their larger A/20 values, ~ -1.07 ± 0.17) than groups 2 and 
5. 

Of the remaining disk-dominated groups, group 5 galax¬ 
ies have much higher M and I statistics ( M ~ 0.26 ± 0.24 
and / ~ 0.52 ± 0.28) than those in group 2 (M ~ 0.16 ± 
0.21 and / ~ 0.29 ± 0.29). However, these two groups have 
similar asymmetry values {A ~ 0.05), A/20 values (~ -1.2), 
and deviations (D ~ 0.1). 

The disk-dominated galaxies of groups 1, 2 and 5 are 
on average less massive, bluer in U — V — J and larger than 
the compact/bulge-dominated galaxies of groups 0, 6 and 9. 


6.2.3 The Intermediate Galaxies: Groups 4 and 8 

Groups 4 and 8 represent an intermediate PC class between 
the compact/bulge-dominated morphologies of groups 0, 6 
& 9 and the disk-dominated groups 1,2 & 5. Group 4 and 8 
both have a population of quenched galaxies. However, the 
quenched galaxies of group 8 are smaller than those of group 
4. 

Both groups 4 and 8 have a large fraction of galaxies 
with n < 2.5 (72 per cent and 80 per cent, respectively). 
However, group 8 galaxies are more likely to be visually 
classified as disks than group 4 galaxies (74 per cent com¬ 
pared to 51 percent). Meanwhile, group 4 galaxies are more 
likely be visually classified as bulge-dominated (41 per cent 
compared to 15 percent). However, the differences between 
groups should taken with caution as the small numbers of 
galaxies in these groups reduces the significance of the per¬ 
centages. 

For groups 4 and 8 the classifications based upon non- 
parametric morphologies do not always agree with classifi¬ 
cations based on Sersic indices or visual inspection. Group 
8 has a much smaller average M20 value (M20 ~ -1.56 ± 
0.17) than group 4 (A/20 ~ -1.11 ± 0.24). This indicates the 
bulges of group 8 galaxies are large and possibly dominate 
the morphology. However, Sersic indices and visual classifi¬ 
cations would suggest there is no sizable bulge component 
for most of these galaxies. Group 4 galaxies have high con¬ 
centrations, low Sersic indices and are the least well defined 
group by bootstrap measures (see Fig. [6]). Meanwhile, the 


G — A/20 diagram suggests a population of irregular galax¬ 
ies while visual classifications find no irregular galaxies. The 
bright nuclear components may be the result of an AGN or 
starburst activity. 


6.2.4 Comparing the Irregular Galaxies of Groups 1 and 9 


The galaxies of groups 1 and 9 are the most likely to be 
classified visually as irregular. While group 1 is defined by 
star-forming disk-dominated clumpy galaxies, group 9 is de¬ 
fined by star-forming bulge-dominated asymmetric galaxies 
with tidal features. These subtle morphological differences 
are missed by Sersic index, C — A and Gini — M 20 based 
classifications and potentially offer clues as to the formation 
and evolutionary tracks of these galaxies. 

Group 9 galaxies display tidal features and irregular 
disks but their strong central bulge is missed by Sersic fits. 
Group 9, itself, shows the power of our PCA classifications 
to find interesting subtypes of galaxy morphology. Group 9 
galaxies are visually classified as disks (41 percent), irreg¬ 
ulars (23 percent) and bulge-dominated disks (12 percent). 
However, small PCI values would indicate group 9 galaxies 
posses a strong central bulge. Meanwhile, group 1 galax¬ 
ies much more likely to be visually classified as a pure disk 
galaxy (72 percent), slightly less likely to be irregular (16 
percent) and are not bulge-dominated (0 percent). Group 1 
galaxies also have higher PCI values, indicating a weaker 
bulge component. Using Sersic index classifications, both 
groups 1 and 9 have a very large fraction of these galaxies are 
disk-dominated (85 percent) as opposed to bulge-dominated 
(15 percent). Groups 1 and 9 would be considered very simi¬ 
lar in a Sersic classification and the differences between these 
groups are more subtle. 

We observe subtle differences between these two groups 
in many statistics; group 9 galaxies are more asymmetric 
(0.22 ± 0.10 vs. 0.13 ± 0.11) and have lower A/20 values 
(-1.40 ± 0.27 vs. -1.07 ± 0.17) than galaxies found in group 
1. Group 9 galaxies are also more concentrated (3.70 ± 0.70 
vs. 2.76 ± 0.40). Meanwhile A/, I and D statistics all display 
an increased enhancement in group 1 galaxies because these 
statistics probe the existence of off-center clumps. 

Based on these differences it is possible these two types 
of galaxies have experienced different formation scenarios 
or exist at different stages along their evolution. Group 9 
galaxies have a large central bulge which could be the result 
of either a merger or the accretion of many star-forming 
clumps in the disk. Meanwhile, group 1 galaxies are still 
clumpy and have small central bulges. Different levels of the 
amount of violent disk instabilities (VDI; Dekel et al.| 2009 


Guo et al. 20151 is a possible explanation for the segrega¬ 


tion of groups 1 and 9. Group 9 galaxies have a larger bulge, 
possibly grown by the migration of clumps to the central 
galaxy regions following repeated VDIs. Meanwhile, group 
1 galaxies, which still have bright clumps in the disk (as evi¬ 
denced by enhanced MID statistics) have yet to experience 
as many VDIs and thus the central bulge remains smaller. 
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7 SUMMARY 


We use a principal component analysis of non-parametric 
morphology measurements (G, A/20, C, A, M, I and D) and 
agglomerative hierarchical clustering to group galaxies into 
a more descriptive schema than the traditional spiral, ellipti¬ 
cal, and irregular categories. The PCA weights we calculate 
(Table [l]) show that non-parametric morphological correla¬ 
tions vary in importance: PCI is based upon M ,/ ,D,M 2 o 
and Gini thus it is interpreted as a bulge strength indicator; 
PC2 is dominated by concentration; and PC3 is dominated 
by asymmetry; the remaining PCs are less important and 
difficult to interpret. 

The size-mass relation is dependent on PCI and PC2. 
Galaxies with high PCI values (stronger bulges) are gener¬ 
ally more compact and quiescent than galaxies with high 
PC2 values. We determine PCI is a valid predictor of 
whether a galaxy is quenched. 

We observe segregations of galaxy morphology by group 
and describe those results as follows: 


• Compact or Bulge-dominated/low PCI, 
per cent 


57 


- Group 6: Most populated group (~ 37 per cent 
of sample, examples seen in Fig. |13| . Very compact and 
most massive galaxies; and contains the largest spheroidal 
(based on Sersic and visual classifications) and quenched 
fraction. 

- Group 0: Large bulge+disk population, has promi¬ 
nent bulge with faint disk component. (~ 13 per cent, Fig. 
14). Contains a sizable fraction of massive and quenched 
galaxies, not to the same extent as group 6 however. 

Group 9: Large and massive galaxies with a substan¬ 
tial irregular population. Visually, these galaxies posses 
tidal tails, bright star-forming knots and a large bulge. 
(~ 6 per cent, Fig. [I5| 

• Bulge+Disk/intermediate PCI, ~20 per cent 

Group 4: Smaller and less massive bulge-dominated 
disk galaxies with high Gini, Sersic index and concentra¬ 


tion values. (~ 9 per cent, Fig. 161 

Group 8: Slightly larger bulge+disk systems. (~ 11 
per cent, Fig. 171 


Disk-dominated/high PCI, ~ 22 per cent 

Group 1: Large galaxies with prominent (albeit) ir¬ 


regular disks. (~ 8 per cent, Fig. 181 


- Group 2: Compact and small disks galaxies. (~ 6 
per cent, Fig. 191 


- Group 5: Large and low mass disk galaxies with 
evidence of disturbances and interactions. (~ 9 per cent, 


Fig. 20) 


• Group -1: Low surface brightness galaxies (~ 1 per 
cent, Fig. |21[ ) with outlier PC values. 

The PC classification scheme separates quenched com¬ 
pact galaxies from larger, smooth proto-elliptical systems, 
and star-forming disk-dominated clumpy galaxies from star¬ 
forming bulge-dominated asymmetric galaxies. Additionally, 


classifications based on PCs provide a different picture from 
those based on Sersic-n or visual inspection. A PC classi¬ 
fication determines ~51 per cent of galaxies are compact 
or bulge-dominated (groups 0, 6 and 9) while visual clas¬ 
sifications determine ~39 per cent of galaxies are bulge- 
dominated (either pure spheroids or disk+spheroids) and 
Sersic indices classify ~20 of galaxies as bulge-dominated 
(n > 2.5). 

In the future we will extend our PCA classifications 
to different redshifts. We will use the classifications defined 
here to study the evolution of star-formation for a variety 
of morphological types. Star-formation can be quenched in 
many ways and with a reliable morphology classification 
for different epochs we can begin to answer the question: 
whether star-formation quenching is occurring at the same 
time as the bulge is forming? A temporal connection be¬ 
tween these two could have important consequences on how 
galaxies have been quenching star-formation. 
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APPENDIX A: GALAXIES WITH FLAG=1 

Galaxies with non-contiguous segmentation maps receive a 
FLAG=1 designation. The disconnected segmentation maps 
could be the result of a few factors: the light of a nearby 
bright galaxy encroaching on a galaxy, low surface brightness 
or low signal-to-noise. For this reason their non-parametric 
morphology measurements are likely to be unreliable. Fig. 
|A1| is the normalized histogram of magnitudes for galaxies 
with either FLAG=0 or FLAG=1. We also show the fraction 
of galaxies per magnitude bin. The number of galaxies with 
FLAG=1 galaxies as a fraction of all galaxies increases up to 
magnitude 24.5, which is the brightness limit of the survey. 
For these reasons we leave these galaxies out of our sample 
in this work, but we will investigate these galaxies in a future 
work. 


APPENDIX B: CONCENTRATION - SERSIC 
INDEX RELATIONSHIP 


Andrae et al. (2011) demonstrated the correlation between 


concentration and Sersic-n. However, this relationship does 
not appear to hold for our high redshift sample. Fig. |B1| 
shows a less established relationship for concentration and 
Sersic-n in our galaxy sample. We show that concentration 
is biased low for very small (r e < 2 kpc), high Sersic n galax¬ 
ies (n > 2.5) which represents ~14 per cent of our sample. 
We also find many z~1.5 galaxies with high concentration 
and low Sersic-n that deviate from the Andrae et al. (2011) 
relation and are not easily explained by measurement bias. 

The PSF for F125W has a full width half-maximum 
(FWHM) of ~0.135”. For many galaxies, r 2 o is smaller than 
the PSF (and in some cases even r e is smaller than the PSF). 
Fig. |Bl| shows that high Sersic galaxies make up some of the 
smallest objects in our sample. These small galaxies can have 
7*80 ~ 0.48”, which is only a few times larger than the PSF. 

We wish to test the effect of the size of the PSF can 
have on measuring the concentration index, particularly for 
small galaxies. To accomplish this we take a pure Sersic sur¬ 
face brightness light profile I ~ exp[(r/r e ) 1/,n ] with r e = 10 
kpc and calculate the Petrosian Radius (Eq.[lJ, rgo, r 2 o and 
thus concentration. We convolve the pure Sersic profile with 
a gaussian with the same FWHM as the PSF. This convolu¬ 
tion has little effect on the concentration for large galaxies. 
However, we noticed in Fig. |B1| that many galaxies have 
very small r e values which could lead to why concentration 
values are lower than anticipated. To test this hypothesis 
we convolve the Sersic surface brightness profile of a small 
galaxy (r e = 1 kpc and 2 kpc) with a gaussian with the 
FWHM of the PSF. This will allow us to observe the effect 
of convolving the surface brightness profile of a small galaxy 
with a PSF of comparable size. 

Fig. |B2| shows the concentration - Sersic relation present 
in our galaxy sample and is color coded by the ratio of the 
size of the PSF to the effective radius of a galaxy. The solid 
red line in Fig. |B2| shows the relation between concentration 
and Sersic calculated for a pure Sersic surface brightness 


profile with r e = 10 kpc (first demonstrated in Andrae et al. 


2011). The thin-thick and thick dashed lines in Fig. B2 show 


the concentration - Sersic relation for a surface brightness 
profile (of a r e = 1 kpc or 2 kpc galaxy) convolved with a 
gaussian with the FWHM of the PSF. Galaxies with high 
FWHM/Re ratios (i.e. the galaxy has a comparable physi¬ 
cal size to the PSF) fall noticeably below the concentration- 
Sersic relation for a pure Sersic surface brightness profile. 
The flatter concentration-Sersic relation of the small galaxy 
surface brightness profiles convolved with the PSF closely 
follows the concentration and Sersic values we measure for 
our sample. As the physical size of a galaxy decreases the 
concentration values are increasingly depressed. We take this 
as evidence that small galaxies (those with physical sizes 
similar to the PSF, r e ~ 1-2 kpc) are most affected by the 
PSF. Thus the reason the concentration values for our galax¬ 
ies are smaller than the relation of Andrae et al. (2011) is 
likely due to the small physical sizes of many galaxies in our 
sample. 

Up to 14 per cent of our total sample maybe be quite 
small (roughly the size of the PSF, r e < 2 kpc) and have 
a high Sersic index (n > 2.5) leading to an artificially de¬ 
pressed concentration value. Many of these galaxies (~80 
per cent) are in group 6. After correcting the concentration 
values these galaxies would instead be classified into group 
0. This suggests that a portion of the group 6 galaxies would 
instead be group 0 if we had higher resolution images. How¬ 
ever, this implies only ~26 per cent of all group 6 galaxies 
would be reclassified as group 0 so there is still a notable 
distinction between these two groups. 
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Figure 13. Group 6 F125W 1.36 < z < 1.97 galaxies, shown in F160W/F125W/F814W RGB 6”x6” postage stamps. p(Group) represents 
the percentage of times a galaxy is classified into group 6 after the scattering test. Very compact and small spheroidal galaxies. This 
group contains the largest spheroidal and quenched fraction. Many of these galaxies are barely resolved which leads to their structureless 
appearance. 
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Figure 14. Group 0: These galaxies are characterized by a strong bulge component surrounded by a fainter smooth disk. 
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Figure 15. Group 9: These galaxies are characterized by their asymmetric, irregular morphologies and strong bulge component. 
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Figure 16. Group 4: These galaxies consist of low-mass smooth galaxies with moderate central concentrations. 
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Figure IT. Group 8: These galaxies represent class of bulge+disk systems with dominant smooth disks. 
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Figure 18. Group 1: These galaxies are primarily large disks and irregulars with bright off-center star-forming knots. 
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Figure 19. Group 2: These galaxies appear to be primarily low-mass star-forming disk galaxies with higher central concentrations and 
few detected star-forming knots. 
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Figure 20. Group 5: Many of these galaxies are low-mass extended star forming disk galaxies with weak (if any) bulge components. 
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Figure 21. Group -1: Low surface brightness galaxies originally in groups 3 and 7, and only have a combined 19 galaxies which are 
outliers from all other groups. 


© 2015 RAS, MNRAS 000, [T][3T| 



















30 M. Peth et al. 



Figure Al. Histogram of F125W J-band Magnitude for galaxies 
with FLAG=1 and FLAG=0 and a plot of the fraction of all 
galaxies with FLAG = 1 designation per magnitude bin (black 
dashed line). 
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Figure Bl. WFC3 125W measured concentration versus F125W Sersic index ( |van der Wei et al.|2012) color coded by ( left panel) Rgo 
and ( right panel) R 20 for the entire sample. Our z ~1.5 galaxies generally follow a shallower relation than previously shown in |Andrae| 
|et al.|201l| (black dashed line). 



Figure B2. Concentration versus Sersic index relation color- 
coded by the ratio of the PSF FWHM (~0.135”) to the effective 
radius of a galaxy. We plot the numerically defined relationship 
for a pure Sersic profile ( [Andrae et al.|[201 l|, red line) and the 
corrections to the pure Sersic profile when the PSF FWHM is 50 
per cent the size of a 2 kpc galaxy ( thick-thin dashed line) and 
when the PSF is as large as a 1 kpc galaxy ( thick dashed line). 
The relative size of the PSF to a galaxy has a large impact on 
the concentration values for galaxies with higher Sersic indices. 
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